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(54) Distributed client- based data caching system and method 

(57) A system and method for enabling data pack- 
age distribution to be performed by a plurality of peer cli- 
ents connected to each other through a network, such 
as a LAN (local area network). Each peer client can 
obtain data packages from each other or from an exter- 
nal server. However, each peer client preferably obtains 
data packages from other peer clients, rather than 
obtaining data packages from the external server. 
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Description 

m«n, , oroc^nt invention relates to a distributed client-based data caching system and method. Specifically, the 

and ierated through a remote computer. know, as «" 
client computer is also often termed only a cl.ent ). The a^mage ot sucn a y 

potentially obtain data from any server on the network. The f'^^^^™ » the load is not 

bandwidth on the netwo^^ requests , thereby 

^nt^ 

retrieval due to the heavy demands placed upon the available b«J^- connections to 

sr. »^=SE^==»-55^ 

separately. Thus. peoay seevees law* "^"V a ™^* « m, beeislte d Ihe proy seevee. ««" '«*"<> 

[00071 According to the present invention, there £ proved a ™^J**™™ being a dedi- 

5e network featuring an ^ * e ™^ stei ot (a, 

cated server, the steps of the method be.ng performed by a data ^^J"? ™~ ^ b ^ ^ tne plu- 
providing a pluraiity of peer clients attached to me a^a % ^S£i a untque iden- 

:™th« 

the Dluralitv of peer dients according to the entry for the data package. 
« SS A^nativety. the list of data packages is stored on the extern* I serve, . 
Si Ac^mingtoprefenedent^ime^of^ 

the first peer cHent Preferably, if alternately the *^^^££?a *™ message far the data package by 
the first peer cUent the method further «*"^ * f S is receded by the first peer dient 

client . -^« a « a a «i the response message are transmitted to the plurality of peer clients 

[001 0] Preferably, the request message and the r ^ nse ™^ 9 ° * _ , re transn1 rtted to the plurality of peer 
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the first peer client to the at least one other peer client substantially before the first peer client obtains the data package 
from the external server. More preferably, the list of data packages is stored on each of the plurality of peer dients. and 
the method further comprises the steps of: (g). receiving the response message from the first peer client by the at least 
one other peer client; and (h) altering the list of data packages being stored by the at least one other peer client for indi- 

5 eating the location of the data package according to the response message. 

[001 2] Alternatively, the list of data packages is stored on each of the plurality of peer clients, and the method fur- 
ther comprises the steps of: (g) receiving the response message from the first peer client by the at least one other peer 
client; and (h) altering the list of data packages being stored by the at least one other peer client for indicating the loca- 
tion of the data package according to a probabilistic function. 

io [001 3] Preferably, the probabilistic function is performed according to a set of equations: 



13 



New location = ^ 



Old location 



Po(x) = l/(generation+l) 



New location Pn(x) = 1- l/(generation+ 1) 



wherein Pn(x) is a probability that the new location is substituted for the old location. Po(x) is a probability that the old 
location is retained, and "generation" indicates how many times the location had been previously changed. 
[0014] Also preferably, an upper limit is predetermined for a number of the plurality of peer clients served substan- 
tially simultaneously by the at least one other peer client, such that if a number of the plurality of peer clients served 
25 substantially simultaneously by the at least one other peer client is greater than the upper limit, the method further com- 
prises the step of: (d) sending a busy message from the at least one other peer client to the first peer client. 
[001 5] Preferably, the external server is a Web server, and the plurality of peer clients is a plurality of Web browsers. 
[0016] Also preferably, the external sewer is a BackWeb™ server, and the plurality of peer clients is a plurality of 
BackWeb™ clients. 

30 [001 7] Preferably, the unique identifier for the data package is an MD5 digest of the data package. 

[001 8] ■- According to still other preferred embodiments of the present invention, the step of retrieving the data pack- 
age is performed according to a protocol based on TCP/IP. Preferably, the protocol is HTTP. Alternatively and preferably, 
the protocol is FTP. 

[001 9] Hereinafter, the term "protocol based on TCP/IP" includes any such protocol, including but not limited to the 
33 HTTP (hypertext transfer protocol) and FTP (file transfer protocol) protocols. 

[0020] Hereinafter, the term "data package" refers to any discrete, identifiable unit of data, including but not limited 
to documents, images, messages, data packages or any other type of data. 

[0021 ] Hereinafter, the term "computing platform" refers to a particular computer hardware system or to a particular 
software operating system. Examples of such hardware systems include, but are not limited to, personal computers 

40 (PC), Apple Macintosh ""computers, mainframes, minicomputers and workstations, which are also non-limiting exam- 
pies of data processors for operating a software application under an operating system. Examples of such software 
operating systems include, but are not limited to, UNIX. VMS, Unux, MacOS™, DOS, one of the Windows™ operating 
systems by Microsoft Corp. (USA), including Windows NT™, Windows 3.x™ (in which "x" is a version number, such as 
"Windows 3.1™"). Windows95™ and Windows98™ . 

45 [0022] For the present invention, a software application could be written in a substantially suitable programming 
language, which could easily be selected by one of ordinary skill in the art The programming language chosen should 
be compatible with the operating system according to which the software application is executed. Examples of suitable 
programming languages include, but are not limited to, C. C++ and Java. 
[0023] Hereinafter, the term "broadcast* may also include "multicast" as well. 

so [0024] The invention is herein described, by way of example only, with reference to the accompanying drawings, 
wherein: 

FIGS. 1 A and 1 B are schematic block diagrams of an exemplary basic system and method according to the present 
invention; 

55 FIGS. 2A-2E are schematic block diagrams of an exemplary request/response protocol and method according to 
the present invention; 

FIG. 3 is a schematic block diagram of an exemplary preferred data-flow diagram according to the present inven- 
tion; 
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FIG 4 is a flowchart of a method for operating the system of the present invention with Web browsers; and 
FIGS. 5A and 56 are exemplary request and response messages according to the present inventoa 

' _ • i= n f 3 *i<rtrihuted client-based data caching system and method, which enable data 

across their local network neighborhood, for example wraiin a uw, ™J*" * architectures are well suited 

server. Furthermore, the network traffic is not ^^^^^^^^uS switching hubs, in 
tor peer-to-peer communications Most other nodes on the net- 

loads to the traffic on the network itself. ;« th» art whenever a client requires a data 

[0027, ^rcurrent.yava« r ec = 

t»n retries the data package from **WW**J*v inventjon ^ an intermediate step. For the present 

^^^^ 

rSn^^olod" before resorting to retried • ^ Qnce a dierrt 

requ-esadaupaduge. queries all the hosts, "^^^^^^^f^f^^m^^ 

tUsdatapackagetrom me peer MMjM ££25,. aching system according to the present 
!r^ e b^ 

p.Lrty o. peer cf,ems 12 ^.^.^^T^S? * « shown for L purples 
line in Figure 1A. Two peer d.ents 12. labeled as * 12 js als0 connected to an external 

of il.ustrat.on only and ^^^^Si^^ Server 16 is shown, a pluralfty of exter- 
server 1 6 of some type by an ^^^^"^^M^Scated server in the sense that this server has a pn- 

ent 12 directly to server 16 (not shown). . . nfl ic Hhlstra ted with reference also to Figure 1B. 

« [0032, ^eo^rauonotsystemioacc^n^ 

in step i. peer client 12. such as peer client 12 looks for a ff^ d ^ , "^ e „ cne ^ 2. peer client 12 que- 
ticu.a?peerdiemi2.«thedesiredda*^^ 

ries any other peer clientfs) 12 on local ne^rk 14 toj Jeterm ne peer dient 22 has the desired 
package. For example, peer diem 20 could query peer d-ent 2* to deter™ • obtains the data package 

<5 data package. In step 3a. if peer dient 22 has the desired data pa«*age. then 

,rom peer client 22. Alternatively, as shown ,n step ^^'^^^on 18. Thus; every peer client 12 * 
peer client 20 obtains the data package from server16 conned ^.^na, server" to~ 

also potentially a server which is internal to local network 14. and hence could descr 

50 ?sr h ^:r.iS^^ 

ing to other peer dients 1 2 as "cached data" £> <**2Tol' me ^ men6on can be implemented, of 

[00341 A number of different possfcle embodiments of ™ b^Tsriefly Figures 2A-2D illustrate an 
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are shown for the purposes of discussion only and without intending to be limiting in anyway. For the purposes of dis- 
cussion only, suppose a peer client 26. labeled "A", wishes to obtain four data packages "W, "X', "Y" and "Z" None of 
these data packages are locai to peer client 26. which must therefore obtain these data packages from either another 
peer client 12 as an internal server, or from an externa) server (not shown). Local area network 24 features two other 
peer clients 1 2: peer client 28, labeled "B", and peer client 30. labeled "C". Peer client 26 must therefore first communi- 
cate a request to peer client 28 and peer client 30 to see if the desired data packages are available at either location 
and then peer client 26 must obtain these data packages from peer client 28 or peer client 30 if the desired data pack- 
ages are available. 

[0036] Preferably, two protocols are used for communication between peer clients on a local area network (LAN) a 
data package-exchange protocol and a control protocol. Specifically, the data package exchange protocol is used to 
transfer data packages between peer clients, once the desired data package has been located, and is described in 
greater detail with respect to Figure 2B below. The control protocol enables each peer client to efficiently build and 
maintain tables which describe the location of available data packages across the local area network by exchanging 
messages. 

[0037] Each peer client maintains two hash-tables which contain information about data package location: a local- 
data packages table and a networ k-data packages table. The local-data packages table is a hash-table of data pack- 
ages which reside on the storage medium or media of the peer client itself. The network-data packages table is a hash- 
table of data packages which reside on the storage medium or media of other clients on the local network. This table 
contains the local area network address of the peer client on which each data package is being stored. The size of this 
hash-table is preferably limited in order to reduce memory consumption. More preferably, each entry in the table has a 
time-stamp, such that older entries are purged when the size of the table exceeds the upper permissible limit 
[0038] In order to effectively identify the desired data package, preferably each data package has unique identifier 
or "fingerprint" associated with it More preferably, this unique identifier is an MD5 digest of the content of the data pack- 
age (for a description of the MD5 specification, which is an industry standard and would therefore be obvious to one of 
ordinary skill in the art. see "RFC 1321" at http://dsJntemic.net/rfc/rfc1321 .txt). 

[0039] Once any peer client knows both the unique identifier and the location of the data package on the local net- 
work, that ciient can then proceed to download the data package. However, the peer ciient may not know the location 
of the desired data package, in which case the client must follow a control protocol according to the present invention 
in order to determine the location of the desired data package and to enable the client to build these hash tables with 
respect to future attempts to locate a data package. 

[0040] The control protocol is used to provide each client with knowledge about the locations of data packages 
across the local network. In the preferred implementation illustrated with respect to Figures 2A-2D, control messages 
are preferably sent and received as broadcast or multicast packets. Local area networks such as Ethernet networks 
support broadcast or multicast packets such that all peer clients on a local area network receive the broadcast or mul- 
ticast packets. Effectively, a single packet can be sent to all peer clients by using broadcast or multicast, thereby reduc- 
ing the amount of traffic on the network required as a result of transmitting the request message (see for example 
Chapter 12. "Broadcasting and Multicasting", of TCP/IP Illustrated Volume, by W. Richard Stevens, Addison- Wesley. 
1994). However, optionally the system of the present invention could poll each peer client individually with a control 
message for that peer client, although this is not preferred since such individually addressed messages would consume 
excessive amounts of available bandwidth. In such a situation, preferably polling would be restricted to a certain group 
of peer clients as internal servers, in order to reduce the amount of traffic on the local area network. 
[0041] For the preferred implementation in which broadcast or multicast is used, more preferably, the decision to 
select either IP multicast or broadcast is made accorcfing to the configuration set by the network administrator for the 
local area network. IP multicast is preferable in terms of load on the clients of the local network, but may not be sup- 
ported on all platforms (operating systems). More preferably, the TTL or Time to Live may be configured. The TTL spec- 
ifies the number of routers a packet can cross before being dropped. Configuring the TTL enables data package sharing 
to be expanded across subnet boundaries. 

[0042] As shown with respect to Figure 2B. the control protocol of the present invention preferably operates as fol- 
lows. In step 1. peer client* A" from Figure 2A looks for a data package on the local storage medium or media. In step 
2, since the data package was not found locally on the medium or media of peer client "A", peer client "A" must down- 
load the data package and therefore preferably murticasts (or alternatively broadcasts) a request messaga A request 
message preferably contains a protocol identifying version number (PVN) for the control protocol of the present inven- 
tion and a list of M05 digests of the needed data packages, as shown in Figure 2C. 

[0043] Optionally and preferably, if more than one data package is desired, a list of requested data packages is 
included in the request message rather than a single MD5 digest, in order to reduce the total number of request mes- 
sages on the network. 

[0044] In step 3. the neighboring clients, shown as peer clients "B" and "C" in Figure 2A, receive this request mes- 
sage and search for the requested data package in their local-data packages hash-table. A peer client which does not 
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, ,i. , h„« not rmiv as shown in step 4a Otherwise, in step 4b the peer client sends a response 
''nathedatapac^el^ 

r^T 96 ; £E£ SSSibute the response message if another client responded pre.ioos.yjn 
^TS^SS^Sc on the .oca area networK Also more prefer**, the peer dient disputes the 

IST ™r:Z*T^^2K if peer dient requests a data parage "W, peer client wou. 

would not reply with a response message, ance peer cl ent C does notl mm b oat a pa : g 

pose cuent "A" requests data packages x."Y ^^J^^r^S?^ Suppose another client "C- 

were found and a TCP ^^^^^ Mern ^ ly ^ e response message optionally contains other .ndi- 
ing peer dient « warting tordatepad^e "^.^"JJ data ^a ges from the responding peer. Preferably. 

peer dient as an ^ 2 J^SS5SwU«» communication protocol, including but not lim- 
XT^ToS^fc^ ^ext Tester Protoco, - HTTP/1. 1", available from 

SedienTtofJ^ 

several peer dients need ^^J^^^^'^T^ since none of these clients _ 
abte scenario for t^^^^S^M S would still be downtoading the data package when 

the present invenoon. first dient is downloading the data pack- 

40 [0050] Preferably, the problem is solved by ™W m, 3 ™™» ° oacjace is not vet complete, in this preferred 

^^^^^^^^~«^«'»'- — - 

« tion is described in greater detail with regard to Figure 2E. „ efefab iv bv broadcasting or multicasting. 

[0051] in step 1 . me requesting client again transmits ^^^f^^ e ^ n ^J e dient transmits a 
and then warts for a '^^^^^^ SEIL the^data package, or in 

" IS -tep.^cli^aeatean^in.^ 

the client which will be serving the data package. Thus, preferably only a single client access 

any given data package. i« received indicating tJie location of only some 

[0053] If a request is sent for multiple ^^^^f^^^SSS packages from the neigh- 

55 ofthedatap*d«gesatane,ghborir^^ 

boring peer client or clients. Next the dient then »"^^2^«^2^sen«r. Thus, the client on.y obtains 
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the data packages from the external server, thereby reducing network traffic. 

[0054] According to preferred embodiments of the present invention, preferably the process of downloading data 
package from peer clients is optimized to reduce the amount of time required for downloading, the load on each indi- 
vidual client and the overall network traffic. Such optimization is performed as follows. 

5 [0055] First, preferably the exit degree of each client is bound, such that each client is only able to serve a fixed, 
limited number of other clients simultaneously. More preferably, the default limit is three other clients, for example, or 
some another appropriate number which is preferably configured by the user or by the network administrator. If an addi- 
tional client attempts to download a data package from a client which is already serving the maximum number of other 
clients will receive a "busy* message. This feature limits the load on each individual client. 

io [0056] Also preferably, the present invention is able to optimize the selection of the best client from which the data 
package should be obtained. For example, if client "A* had already downloaded a larger portion of the required data 
package than client "B", transferring the data package from client "A" is more optimal. Such clients are preferentially 
selected to serve data packages, since these clients will be able to serve the data package after a shorter time period 
has elapsed. Such preferential selection occurs by shortening the time period for waiting before these clients respond, 

is thereby increasing the likelihood that they will serve the data packages. For this reason, the client preferably calculates 
the random delay before respondng such that the delay is inversely proportional to the percentage of the data package 
which has been already downloaded. In addition, the random delay is preferably proportional to the number of clients 
being served at the moment in order to decrease the likelihood of overloading already busy clients. 
[0057] In addition, according to other preferred embodiments of the present invention, preferably the entries of the 

20 locations of data packages in the network data packages table are updated according to a probabilistic function. Such 
a function is preferred in order to prevent all of the clients from registering a single client as the server for any particular 
data package, for example. When different clients respond, usually at different times, indicating they have a specific 
data package, the remaining clients listening across the network update the entry for this data package in their network 
data packages table, by adding the IP address, or some other type of address according to the addressing system 

25 employed by the network, of the client which can serve the data package to this table. In a simple implementation, the 
clients would store only the last advertised location of each data package, and therefore many or all clients might 
attempt to obtain the data package from a single client as the internal server, thereby overloading that client 
[0058] To avoid this situation, preferably the following probabilistic algorithm is used to determine the particular cli- 
ent address which is stored in the network data packages table. Each time a new client transmits a response message, 

so indicating that this client is able to serve a particular data package, the probability that the new IP address of the new 
client is substituted for the old IP address is calculated according to the following equations: 
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New IP address = < 



Old IP address Po(x) = l/(generation+ i) 



New IP address Pn(x) = l-i/(generation+l) 



wherein Pn(x) is the probability that anew IP address is substituted for the old IP address, Po(x) is the probability that 
the old IP address is retained, and "generation" is a number indicating how many times this address had been previ- 
ously changed. 

45 [0059] For example, if client *A" responds indicating it has data package "X", then initially all other peer clients store 
the IP address of client "A" as the location of data package X. If client "B" then broadcasts a response also indicating - 
that client "3" has data package "X", then the probability that any one client now changes the IP address for the location 
of data package "X" is 50%. In other words, about half of the clients should now point to client "A" and about half should 
point to dient 

so [0060] Such a substantially even distribution of load across multiple clients should produce data-flow with a tree- 
shaped topology, as shown in Figure 3. rather than a random topology, thus optimizing the average download time and 
the load on the serving clients. 

[0061] Furthermore, if any client requests a particular data package during the period required by client" A" for 
downloading that package, preferably client "A" sends a broadcast or multicast message indicating that the package is 
55 in the process of being downloaded. Therefore, preferably only a single client "B" polls client "A" for each data package, 
for example. Other clients preferably automatically receive any responses from that polling action though the broadcast 
or multicast transmission, and thus will not be forced to poll for themselves. 

[0062] The polling (request/response) traffic is optimized since there is usually no need to transmit both a request 
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• ^ h**. nackaae needed by each client. Such optimization is possible since each client preferably 

Ls a timer, for detection of an aborted Mr or. J^^* p^y repeats the whole 
packages directly from the external server. downloaded by a peer 

naJL aLg period of time, the ^^^^^^Sia.^ automatic detection of peer 
10065] ^% to jf^ r S^S« detect the presence of other peer dents on 

clients is supported. Such automatic detection enaaes eawi p» invention is disabled, since the oper- 

the network. If such peer clients are .not four- L pre J^^^^SSIS-^ a data package « no 
ation of the system as described above would only prolong the time penoa requires 

ble by the user or by the network administrator. , nairttlorize d access of local storage media through the 

[0067] Furthermore, in order * Jp-XS^^S^S Sample Preferably only data 
system of ^^Z^^^JS^^^- — ed data package, are 
packages identrfied in the hash tables are »' 0 D ™ such tnat maRcious users prefera- 

preferabfy only data packages which were intended » bsserwdtoj • JJ^JJ^ ^ media of a D eer 

t* cannot use the system of ^^^ e SbyC:C £5T£h as their 1 28*it MD5 digest. 
3 , client. Data packages are more preferably only '^^^irL intended recipient knows this digest. Thus. 

fc^ccom^^^ 

SST 'nsteplofRgure^aWebbrc^erbeingoper^ 
First tleWeb browser looksat^^ 
« in the .oca. cache, then that data package ^^^TT^^^e transmission. The data pack- 
sage requesting this data package, preferably by ^^^S^unSe^dentffier is the URL of the data 

request message ccnturting the 1st ^^^f^^^lS^i an HTML (hypertext mark-up Ian- 
work. Such a situation may anse ,f. for example, the ^ browser* ac J"^" 5 request message 

guage) document, or Web page, which contains ^ !^(?VNH =or instance: -V1.0". As shown 
cordis an identifying 'magic to identify the data package or 

nSsTges. and a unique identifier aentfying ^^^^^ueS messages of this type. These Web 



8 



EP 0 993 163 A1 



browser even if the requested URL is still being downloaded by that Web browser. 

[0072] In step 3. if no response to an issued request message is received within a certain amount of time, for exam- 
ple 5 seconds, then the process is preferably rimed out. In this case, the Web browser preferably no longer attempts to 
obtain the URL from another Web browser, and the URL is obtained from the regular Web server using regular HTTP 

s protocol. Before starting to download the data package from the regular Web server, the Web browser preferably trans- 
mits a response message indicating that this particular Web browser is downloading the data package. 
[0073] On the other hand, if a response message is received, the Web browser obtains the URL from the other Web 
browser which indicated that it had the URL in the local cache. Preferably, Web browsers across the network record the 
URLs and the address from which the response message originated for future use, such that these Web browsers 

w would be able to download the URL at a future time without first transmitting the request messaga 

[0074] Once the Web browser is able to locate a data package on a neighboring Web browser, the Web browser 
attempts to download the data package The downloading process is performed with a suitable data-transfer protocol, 
such as HTTP or FTP. If a time-out or other failure occurs during the processing of data package transfer, the receiving 
Web browser preferably performs substantially the entire procedure more than once. More preferably, the number of 

is permitted attempts to retry the transfer is configurable. If the process fails after these attempts have been performed, 
preferably the Web browser transfers the required data package or data packages from the regular Web server. 
[0075] According to preferred features of this embodiment of the present invention, data package downloading is 
well distributed, such that the Web browsers do not obtain a data package from only a single Web browser, but rather 
obtain the data package from a plurality of Web browsers. Such distribution is maintained as follows. 

20 [0076] First, preferably the number of simultaneous data package transfers from a single Web browser is limited. If 
this number is exceeded, the Web browser transmits a "busy" message to other Web browsers attempting to transfer 
the data package. Next, preferably once a Web browser receives a message giving the location of a particular data 
package, the corresponding entry in the hash table for that data package is not altered every time another response 
message is received pertaining to this data package. The hash table is preferably altered by subsequent messages in 

25 a probabilistic manner, such that the probability that any particular entry is updated to indicate a new location of a data 
package is equal to 1/(generatior*l). where •generation' counts the number of times a response message was received 
for that data package. 

[0077] For example, if Web browser "A" transmits a response message indicating that data package "X" is on the 
local cache; then initially all of the neighboring Web browsers have an entry in the hash table indicating that Web 
zo browser "A" is the location of data package "X". If Web browser "B" then transmits a response message for data pack- 
age "X", then each Web browser preferably now alters the entry in the hash table to indicate a new location of data 
package "X" with a probability of about fifty percent, such that about fifty percent of the Web browsers now have an 
entry indicating that the data package is available from Web browser "A" and such that about fifty percent of the Web 
browsers now have an entry indicating that the data package is available from Web browser "B". Thus, a good load dis- 
ss tribution can be achieved. 

[0078] The random delay (mentioned in step 2 above) chosen by a browser is proportional to the number of cur- 
rently served browsers, or the number of browsers currently downloading data packages from that browser, and 
inversely proportional to the amount of the data package already downloaded by it. This way the browsers more eligible 
to download from are more likely to be chosen by other browsers to serve these data packages. 
40 [0079] While the invention has been described with respect to a limited number of embodiments, it will be appreci- 
ated that many variations, modifications and other applications of the invention may be made. 

Claims 

45 1 . A method for distributing data packages across a network, the network featuring an external server for serving at 
least one data package, the external server being a dedicated server, the steps of the method being performed by 
a data processor, the method comprising the steps of: 

(a) providing a plurality of peer clients attached to the network and providing a list of data packages, said data 
so packages being stored by each of said plurality of peer clients, each data package of said data packages hav- 
ing an entry in said list, said entry indicating a unique identifier for said data package and a location of said data 
package in at least one of said plurality of peer clients; 

(b) examining said list of data packages by a first peer client to find an entry tar a required data package; and 

(c) if said entry for said data package is present on said list of data packages of said first peer client, retrieving 
55 said data package from said location at another of said plurality of peer clients according to said entry for said 

data package. 

2. The method of claim 1 , wherein said list of data packages is stored on at least said first peer client 
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